This repository contains the full Anthromes-12k-DGG dataset in .csv format for all three uncertainty levels (baseline, lower and upper), as well as the an12_dgg_inputs shapefile with the DGG polygons and supplemental data which can be joined to the .csv files for mapping. Please refer to the associated R analysis code and research compendium at https://doi.org/10.7910/DVN/6FWPZ9 for the associated anthrome class names and example code for working with these data in R.


Raw and select intermediate data for the Anthromes 12k DGG (v1) analysis are for the sake of reproducibility, as the full analysis workflow is time consuming and resource intensive. If you'd only like to reproduce the analysis.rmd document from the accompanying R compendium, which runs the main analysis and visualizations, you can ignore the raw-data directory entirely and use the data in the derived-data directory instead, along with the an12_dgg_xxx.csv and an12_dgg_inputs.shp files in the main data release.

The raw-data directory contains the following files that are required to run the DGG_preparation.rmd script in the accompanying R compendium:
 - dgg_ids.csv, a csv of DGG cell ids to use as a land mask for all analyses.
 - HYDE.zip, zipped versions of the HYDE 3.2 input data (this is provided for convenience and reproducibility only, please cite the original HYDE 3.2 publication if you use these data).
 - supporting_5m_grids.zip, supporting 5 arc minute grids used as fixed inputs for the HYDE/Anthromes analysis.

Additional input files required for the Anthromes 12k DGG (v1) analysis:
 - three_conditions_v4_rep_data.csv, available at https://doi.org/10.7910/DVN/JNNK7B
 - WCMC_natural_modified_habitat_screening_layer/ and files therein, available at https://doi.org/10.34892/4Q5V-GF37

Download these data and add them to the raw-data/ directory, and unzip the HYDE.zip and supporting_5m_grids.zip, if you'd like to rerun the DGG_preparation.rmd and anthrome_classification.rmd scripts from scratch.

The derived-data directory contains the following intermediate files required for the analysis.rmd script. These can be regenerated using the above raw input data, but are provided here for convenience:
 - contemp_vars.csv, contemporary biodiversity variables ready to join to the DGG shape files.
 - hyde_dgg is a compressed .Rds file storing the HYDE 3.2 data in DGG format for the baseline uncertainty level. This is required for calculating the HYDE 3.2 population time series in the main analysis.


